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OIPE 



RAW SEQUENCE LISTING 

PATENT APPLICATION; US/09/94 5 , 254 

Input Set : A:\Seqlist.txt 

Output Set: N:\CRF3\0 9212001\I94 5254.raw 



DATE: 09/21/2001 
TIME: 20:48:55 



C--> 
C--> 



4 <110> APPLICANT: Meyers, Rachel 

5 MacBeth, Kyle 

6 Tsai, Fong-Ying 



8 
9 
11 
13 
13 
13 
14 
16 
18 
20 
21 
22 
23 
25 
26 



ENTERED 



<120> TITLE OF INVENTION: 8797, A NOVEL HUMAN 

GALACTOSYLTRANSFERASE AND USES THEREOF 
<130> FILE REFERENCE: MNI-188 

<140> CURRENT APPLICATION NUMBER: US/09/945,254 
<141> CURRENT FILING DATE: 2001-08-31 

<150> PRIOR APPLICATION NUMBER: 60/229,829 
<151> PRIOR FILING DATE: 2.000-0.8-31 
<160> NUMBER OF SEQ ID NOS : 3 

<170> SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO: 1 
LENGTH: 4052 
TYPE: DNA 

Homo sapiens 



<210> 
<211> 
<212> 
<213> 
<220> 
<221> 



ORGANISM 
FEATURE : 
NAME/KEY 



27 <222> LOCATION 
29 <400> SEQUENCE 



CDS 

(459) . . . (1592) 

1 

30 ccaagattta aagcccgcaa gttttgttct tgagaccagc gactttagct ccgatgcggg 60 

31 aaggaaagcc gacctccgat ttggacattt aaagagctgg gcttgaactt cgtgagtttc 120 

32 gctctaaact gcccttgaaa tgaagctgga cttggaggtg gcatggaata ttcacatggg 180 

33 agagccgcat gaggccgccc accacgcttc ctgaaggatg cccgtgtgga agaattttga 240 

34 cgtgccagtg tcctcgttct acagggtgtt ccattcttcc gcaatctcag aaaaatggga 300 

35 ctaaaagaaa ctattttgta aaataagaag acttccattt ttaatgacca acatgtatta 360 

36 agatggacac ctactctacg aaacacgaag ttctatggtc tcgaagaagc ccgtgcctgt 420 

37 ttaaaactga tcctaactaa aaacagactt gagtggat atg aga atg ttg gtt agt 476 

38 Met Arg Met Leu Val Ser 

39 15 

41 ggc aga aga gtc aaa aaa tgg cag tta att att cag tta ttt get act 524 

42 Gly Arg Arg Val Lys Lys Trp Gin Leu lie lie Gin Leu Phe Ala Thr 

43 10 15 20 

45 tgt ttt tta gcg age etc atg ttt ttt tgg gaa cca ate gat aat cac 572 

46 Cys Phe Leu Ala Ser Leu Met Phe Phe Trp Glu Pro lie Asp Asn His 

47 25 30 35 

49 att gtg age cat atg aag tea tat tct tac aga tac etc ata aat age 620 

50 lie Val Ser His Met Lys Ser Tyr Ser Tyr Arg Tyr Leu lie Asn Ser 

51 40 45 50 

53 tat gac ttt gtg aat gat acc ctg tct ctt aag cac acc tea gcg ggg 668 

54 Tyr Asp Phe Val Asn Asp Thr Leu Ser Leu Lys His Thr Ser Ala Gly 

55 55 60 65 70 

57 cct cgc tac caa tac ttg att aac cac aag gaa aag tgt caa get caa 716 

58 Pro Arg Tyr Gin Tyr Leu lie Asn His Lys Glu Lys Cys Gin Ala Gin 

59 75 80 85 

61 gac gtc etc ctt tta ctg ttt gta aaa act get cct gaa aac tat gat 764 

62 Asp Val Leu Leu Leu Leu Phe Val Lys Thr Ala Pro Glu Asn Tyr Asp 
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RAW SEQUENCE LISTING DATE: 09/21/2001 

PATENT APPLICATION: US/09/94 5 , 254 TIME: 20:48:55 
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129 ata att etc ctt tgt aaa att age tat gtg gac aca tac cct tgt agg 1580 

130 lie lie Leu Leu Cys Lys lie Ser Tyr Val Asp Thr Tyr Pro Cys Arg 

131 360 365 370 

133 get gcg ttt ate taatagtact tgaatgttgt atgttttcac tgtcactgag 1632 

134 Ala Ala Phe lie 

135 375 

137 tcaaacctgg atgaaaaaaa cctttaaatg ttegtctata ccctaagtaa aatgaggacg 169J 

138 aaagacaaat attttgaaag cctagtccat cagaatgttt ctttgattct agaagctgtt 1752 

139 taatatcact tatctacttc attgectaag ttcatttcaa agaatttgta tttagaaaag 1812 

140 gtttatatta ttagtgaaaa caaaactaaa gggaagttca agttctcatg taatgecaca 1872 

141 tatatacttg aggtgtagag atgttattaa gaagttttga tgttagaata attgettttg 1932 

142 gaaaatacca aatgaacgta cagtacaaca tttcaaggaa atgaatatat tgttagacca 1992 

143 ggtaagcaag tttatttttg ttaaagagca cttggtggag gtagtagggg cagggaaagg 2052 

144 tcagcatagg agagaaagtt catgaatctg gtaaaacagt ctcttgttct taagaggaga 2112 

145 tgtagaaaaa tgtgtacaat gttattataa acagacaaat cacgtcttac cacatccatg 2172 

146 tagctactgg tgttagagtc attaaaatac etttttttge atcttttttc aaagtttaat 2232 

147 gtgaactttt agaaaagtga ttaatgttgc cctaatactt tatatgtttt taatggattt 2292 

148 ttttttaagt attagaaaat gacacataac aegggcaget ggttgctcat agggtccttc 2352 

149 tctagggaga aaccattgtt aattcaaata agctgatttt aatgacgttt tcaactggtt 2412 

150 tttaaatatt caatattggt ctgtgtttaa gtttgttatt tgaatgtaat ttacatagag 2472 

151 gaatataata atggagagac ttcaaatgga aagacagaac attacaagee taatgtctcc 2532 

152 ataattttat aaaatgaaat cttagtgtct aaatccttgt actgattact aaaattaacc 2592 

153 cactcctccc caacaaggtc ttataaacca cagcactttg ttccaagttc agagttttaa 2652 

154 attgagagca ttaaacatca aagttataat atctaaaaca atttattttt catcaataac 2712 

155 tgtcagaggt gatctttatt ttctaaatat ttcaaacttg aaaacagagt aaaaaagtga 2772 

156 tagaaaagtt gccagtttgg ggttaaagca tttttaaagc tgcatgttcc ttgtaatcaa 2832 

157 agagatgtgt ctgagatcta atagagtaag ttacatttat tttacaaagc aggataaaaa 2892 

158 tgtggctata atacacacta cctcccttca ctacagaaag aactaggtgg tgtctactgc 2952 

159 tagggagatt atatgaaggc caaaataatg acttcagcaa gagtgactga actcactcta 3012 

160 aggectttga ctgeagagge acctgttagg gaaaatcaga tgtctcatat aataaggtga 3072 

161 tgteggaaac aegcaaaaca aaacgaaaaa agatttctca gtatacacaa ctgaatgatg 3132 

162 atacttacaa tttttagcag gtagcttttt aatgtttaca gaaattttaa tttttttcta 3192 

163 ttttgaaatt tgaggcttgt ttacattget tagataattt agaattttta actaatgtca 3252 

164 aaactacagt gtcaaacatt ctaggttgta gttactttca gagtagatac agggttttag 3312 

165 atcattacag tttaagtttt ctgaccaatt aaaaaaacat agagaacaaa agcatatttg 3372 

166 accaagcaac aagcttataa ttaattttta ttagttgatt gattaatgat gtattgeett 3432 

167 ttgeccatat ataccctgtg tatctatact tggaagtgtt taaggttgcc attggttgaa 3492 

168 aacataagtg tctctggcca tcaaagtgat cttgtttaca gcagtgcttt tgtgaaacaa 3552 

169 ttatttattt gctgaaagag ctcttctgaa ctgtgtcctt ttaatttttg cttagaatag 3612 

170 aatggaacaa gtttaaattt caaggaaata tgaaggcact tccttttttt ctaagaagga 3672 

171 agttgctaga tgattccttc atcacactta cttaaagtac tgagaagagt atctgtaaat 3732 

172 aaaagggttc caacctttta aaaaagaagg aaaaaacttt ttggtgctcc agtgtagggc 3792 

173 tatcttttta aaaaatgtca acaaagggaa aataaactat cagcttggat ggtcacttga 3852 

174 atagaagatg gttatacaca gtgttattgt taaaattttt ttaccttttg gttggtttgc 3912 

175 atcttttttc catattgtta attttatacc aaaatgttaa atatttgtat tacttgaatt 3972 

176 ttgctcttgt atggcaaaat aattagtgag tttaaaaaaa atctatagtt tccaataaac 4032 

177 aactgaaaaa ttaaaaaaaa 4052 

179 <210> SEQ ID NO: 2 

180 <211> LENGTH: 378 
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Aia p 


Tin 

lit! 


Php 
r 1 Itr 


lie 


n x o 


Mpt 

l Y lt: L 


-v^- >--v 

Pro 


Asn 


Leu 


T 1 A 

lie 


Pin 

\j 1 U 


lyr 


Leu 


Pin 

bin 


"i 1 A 

210 






1 Q S 
X .7 J 










9 0 0 

zuu 










T A c 

zU j 








211 


o e X 


T n i l 


Pin 
UlU 


Pin 
bX 11 


Tip 




1 

V ul 


p 1 n 

\JJ X 1 1 


Asp 


pne 


Trp 


Tin 

lie 


P 1 T T 

biy 


Arg 


v a l 


LI -1 c 
nlS 


"i i ^\ 

t-j _L ^ 




^ X \J 










Z J_ D 










O O A 
Z Z U 










2 1 J 


nJ y 


b x y 


Aid 


D to 
rlU 


P "TO 


Tip 
x x t; 


nx y 


on 
n o ^ 


Lys 


O s~\ -v* 

ber 


ber 


Lys 


Tyr 


iyr 


v a i 


O e-i t- 

ber 


214 


9 S 

Z i, J 










Z j U 










n 1 c 

2 3 5 










z 4 (J 


2 1 3 


Tyr 


Glu 


Met 


Tyr 


Pin 
bill 


i rp 


r I U 


Ala 
nla 


Tyr 


pro 


Asp 


Tyr 


T V^l -r- 

i nr 


Ala 


P 1 T T 

b iy 


Aid 

Aia 


216 










24 5 










OCA 

2 d0 










n C 
ADD 




21/ 


Ala 


Tyr 


Val 


He 


Ser 


Gly 


Asp 


Val 


Ala 


Ala 


Lys 


val 


Tyr 


(jlU 


Aia 


ber 


218 








260 










265 










270 






219 


Gin 


Thr 


Leu 


Asn 


Ser 


Ser 


Leu 


Tyr 


He 


Asp 


Asp 


Val 


Phe 


Met 


Gly 


Leu 


220 






275 










280 










285 








') O I 

X 


Cys 


Ala 


Asn 


Lys 


He 


Gly 


He 


Val 


Pro 


Gin 


Asp 


His 


Val 


Phe 


Phe 


Ser 


"1 "> o 

£j 




290 










295 










300 










TO-) 

J^l 4_t 


Gly 


Glu 


Gly 


Lys 


Thr 


Pro 


Tyr 


His 


Pro 


Cys 


He 


Tyr 


Glu 


Lys 


Met 


Met 


224 


305 










310 










315 










320 


*j ij j 


Thr 


Ser 


His 


Gly 


His 


Leu 


Glu 


Asp 


Leu 


Gin 


Asp 


Leu 


Trp 


Lys 


Asn 


Ala 


226 










325 










330 










335 




227 


Thr 


Asp 


Pro 


Lys 


Val 


Lys 


Thr 


He 


Ser 


Lys 


Gly 


Phe 


Phe 


Gly 


Gin 


He 


228 








340 










345 










350 






") Q 

^ ^ J 


Tyr 


Cys 


Arg 


Leu 


Met 


Lys 


He 


He 


Leu 


Leu 


Cys 


Lys 


He 


Ser 


Tyr 


Val 


230 






355 










360 










365 
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Asp 


Thr 


Tyr 


Pro 


Cys 


Arg 


Ala 


Ala 


tr lie 


X X e 
















■~> ^ ") 




370 










375 






















4i J J 


<210> SEQ ID NO 


: 3 




























<211> LENGTH: 1134 


























-) t 7 


<212> TYPE: 


DNA 




























9 1 P 
Z J o 


<213> ORGANISM: 


Homo sapiens 




















^ 4 w 


<220> FEATURE : 




























9 A 1 

ill 


<221> NAME/KEY: 


CDS 


























"> A 9 


<222> LOCATION: 


( l ) 


. . . (1134 ) 






















<400> SEQUENCE: 


3 


























*i -J 


a t n 
a uy 


Qua 


P\ t (T 

a Ly 


f fry 

l uy 


att 


acrt 


era c 
y y ^ 


aaa 


Pi n f\ 
a y a 


y it 


A P\ F\ 

aaa 


pi 3 pi 

aaa 


uy y 




tta 

u u a 


att 

a L- u 


4 8 




11C u 


ni y 


1*1 e L 


j_i e u 


Va 1 




V 


A rcr 


rt x y 


V d l 


ij y o 


ijy ir> 


T rn 
lip 


m n 

\J X 11 


Toil 

x e u. 


x x e 






1 








S 










i n 

X u 
















_ 4 ^7 


^ t t 
d U L 


tdy 


tta 
L Lu 


ttt 

L L L 


art 


act 


tat 


ttt 


1^ 1" 3 
u L_a 


r~i rr 

y ty 


a y t 


t L t 


p\ Y cj 
a uy 


u u u 


ttt 
u u u 


t rr rr 
Ly y 


y \j 


^ DU 


T 1 O 
lie 


C, 1 n 


T 1 1 

Xie u 


r lie 


A 1 a 

r\ i a. 


J. 1 i X. 




Ph p 


Lcli 


/Via 


Cat" 
OeX 


lie U 


rlc L 


r^lle 


Ph 0 
rile 


lip 




TCI 

Z J 1 








20 










Z D 


















n c; -i 
« J j 


rr P\ Pi 
yaa 


U- U- CI 


a L- U- 


rr a t 


a rl t 


eac 


att 


ata 
y •-y 


a y iw- 


f* Ft I - 
t a u 


a uy 


3 3 ry 
a a y 


1~ 3 

LU U 


tat 
ua u 


t rt 
u 0 u 


tap 


1 4 A 

_L T T 




flu 
t X U 


P rn 


lie 


/A O p 


J on 
noil 


Hi ^ 

11 _1_ 


lie 


Val 


oe X 


Hie; 
ula 


l v le U 




Del 


1 yx 


O e 1 


T V T 

1 y x 




Z J 9 
















AO 










A ^ 










t c 7 
J / 


dya 


"J- 3 /~i 


P* "t~ o 

ttt 


"3 T 

d Ld 


a 3 t 


ay t 


tat 


CI ri H 




rr +* rr 

g Ly 


a ^ f 

a a u 


rra f 

y d L 


att 


f~* t CT 

t uy 


t nt 


nf t 

L L L 


X ^7 Z 


Q 


Arg 


i yr 


l_y tr U 


X X e 


Acn 


Cp r 
Del 


T VT 


Acn 


n Vi /-^ 


v a x 


as n 


ASp 


T Vi r 
1 III 


ijeu 


Otil 


T d 1 1 

Leu. 




^> ^ Q 

~ d y 




so 










S S 










D u 














day 


p» Pt P 1 

tat 


3 pi P* 

att 


"t~ pi 

Ltd 


rr r 1 rr 
y ty 


y y y 


cct 


can 


Lat 


C* 3 ^ 

t a a 


1~ ^ 
Lat 


l Ly 


af f 

a u u 


aap 

aat 


f p> c 

t a L- 


a a rr 
a a y 


? A 0 


zo z 


i_iy o 


o 1 o 


I 11X 


C o 

ocX 


x a. 




P "TO 
r 1 U 


A, -pry 
zt. 1 y 


lyr 


Li i n 


lyr 


Leu 


lie 


Asn 


nib 


Lys 






u ~> 










70 










/ 3 










ft 0 




9 6 S 

Z U J 


era 3 
ydd 


day 


-r pr "r 

uy l 


tda 


(7 r t" 

M l_ 


CI d 


era r 1 

M U \^ 


rrt r 1 


t Lt 


L L L 


u ua 


nf n 

l u y 


ttt 

U U L 


y ua 


aaa 


a 

a 0 i_ 


2 8 8 

ii u u 


0 £ £ 
ZOO 


Glu 


Lys 


Cys 


Gin 


A 1 ^ 
r\ X a. 


VJ X 11 




v a i 


ije U 


T q i i 

Xie U 


Xifc! U 


T All 


Ph 0 
rile 


V d X 


T \T c 

i_iy 0 


1 J1X 




ZD/ 










ft S 










q n 
y u 










Q ^ 

y d 






£ Q 


get 


cct 


gaa 


aac 


a u 


gat 


cga 


cgt 


LLC 


rr r"T ^ 

gga 


d L L 


d y d 


agg 


aty 


T_gg 


ggc 


7 ^ 

J O U 


J / u 


Ala 


Pro 


Glu 


Asn 


iyi 


Asp 


Arg 


Arg 


O /-\ 

ber 


u xy 


lie 


Arg 


TV "v* 1 /t 

Arg 


1 nr 


I rp 


\j iy 




/ 1 








100 










i nc 

IUj 










11U 








Z / J 


aat 


gaa 


aat 


tat 


gtt 


egg 


tct 


cag 


r~\ \- r~T 

t Ly 


a a f 

a a L. 


fr ^ 

y 1 1 


aat 


a f n 

a Lt 


Pi Pi Pi 

aaa 


a t u 


r 1 1 rr 
t uy 


^ ft A 




Asn 


Glu 


Asn 


Tyr 


Val 


Arg 


Ser 


Gin 


ijeu 


as n 


A 1 a 

Ala 


as n 


ixe 


Liys 


1 nr 


Leu 




T7C 






115 










120 










1 oq 
X Z D 










9 7 7 
Z / / 


ttt 


gec 


tta 


gga 


act 


cct 


aat 


cca 




gag 




y d d 


y d d 


-\- -3 


t dd 


^i rr Pi 

d y d 


A ^ 9 
4 J z 


17Q 

Z / O 


Phe 


Ala 


Leu 


Gly 


Thr 


Pro 


Asn 


Pro 


Lieu 


Lj 1 u 


kj iy 


LjIU 


til u 


Leu 


p 1 n 

bin 


Arg 




279 




130 










135 










140 












281 


aaa 


ctg 


get 


tgg 


gaa 


gat 


caa 


agg 


tac 


aat 


gat 


ata 


att 


cag 


caa 


gac 


480 


282 


Lys 


Leu 


Ala 


Trp 


Glu 


Asp 


Gin 


Arg 


Tyr 


Asn 


Asp 


He 


He 


Gin 


Gin 


Asp 




283 


145 










150 










155 










160 




285 


ttt 


gtt 


gat 


tct 


ttc 


tac 


aat 


ctt 


act 


ctg 


aaa 


tta 


ctt 


atg 


cag 


ttc 


528 


286 


Phe 


Val 


Asp 


Ser 


Phe 


Tyr 


Asn 


Leu 


Thr 


Leu 


Lys 


Leu 


Leu 


Met 


Gin 


Phe 




287 










165 










170 










175 






289 


agt 


tgg 


gca 


aat 


acc 


tat 


tgt 


cca 


cat 


gec 


aaa 


ttt 


ctt 


atg 


act 


get 


576 


290 


Ser 


Trp 


Ala 


Asn 


Thr 


Tyr 


Cys 


Pro 


His 


Ala 


Lys 


Phe 


Leu 


Met 


Thr 


Ala 




291 








180 










185 










190 








293 


gat 


gat 


gac 


ata 


ttt 


att 


cac 


atg 


cca 


aat 


ctg 


att 


gag 


tac 


ctt 


caa 


624 


294 


Asp 


Asp 


Asp 


lie 


Phe 


lie 


His 


Met 


Pro 


Asn 


Leu 


He 


Glu 


Tyr 


Leu 


Gin 




295 






195 










200 










205 
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